Award  Number:  DAMD17-02-1-0729 


AD. 


TITLE:  Use  of  a  Novel,  Stable  Gene  Silencing  Technology 

to  Determine  the  Contribution  of  the  Receptor  Tyrosine 
Kinases  to  the  Breast  Cancer  Phenotype 


PRINCIPAL  INVESTIGATOR:  Douglas  S.  Conklin,  Ph.D. 


CONTRACTING  ORGANIZATION:  Research  Foundation  of  the  State 

University  of  New  York 
Albany,  New  York  12222 


REPORT  DATE:  October  2004 


TYPE  OF  REPORT:  Annual 


PREPARED  FOR:  U.S.  Army  Medical  Research  and  Materiel  Command 
Fort  Detrick,  Maryland  21702-5012 


DISTRIBUTION  STATEMENT:  Approved  for  Public  Release; 

Distribution  Unlimited 


The  views,  opinions  and/or  findings  contained  in  this  report  are 
those  of  the  author (s)  and  should  not  be  construed  as  an  official 
Department  of  the  Army  position,  policy  or  decision  unless  so 
designated  by  other  documentation. 


r 


I  20050712  055 


REPORT  DOCUMENTATION  PAGE 


Form  Approved 
OMB  No.  074-0188 


Public  reporting  burden  lor  this  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and  maintaining 
the  data  needed,  and  completing  and  reviewing  this  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  Information,  including  suggestions  for 
reducing  this  burden  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington,  VA  22202-4302,  and  to  the  Office  of 
Management  and  Budget,  Paperwork  Reduction  Project  10704-0186).  Washington,  DC  20503 _ .  _  ■ _ . _ 


1.  AGENCY  USE  ONLY  2.  REPORT  DATE  3.  REPORT  TYPE  AND  DATES  COVERED 


(Leave  blank) 


October  2004 


Annual  (30  Sep  03  -  29  Sep  04) 


4.  TITLE  AND  SUBTITLE 

Use  of  a  Novel,  Stable  Gene  Silencing  Technology 
to  Determine  the  Contribution  of  the  Receptor  Tyrosine 
Kinases  to  the  Breast  Cancer  Phenotype 


5.  FUNDING  NUMBERS 

DAMD17-02-1-0729 


6.  AUTHOR(S) 

Douglas  S. 


Conklin,  Ph.D. 


7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Research  Foundation  of  the  State  University  of  New  York 
Albany,  New  York  12222 


E-Mail:  dconklin@albany.edu 


9.  SPONSORING  /MONITORING 

AGENCY  NAME(S)  AND  ADDRESS(ES) 

U.S.  Army  Medical  Research  and  Materiel  Command 
Fort  Detrick,  Maryland  21702-5012 


8.  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


10.  SPONSORING /MONITORING 
AGENCY  REPORT  NUMBER 


12a.  DISTRIBUTION  /  AVAILABILITY  STATEMENT  12b.  DISTRIBUTION  CODE 

Approved  for  Public  Release;  Distribution  Unlimited 


13.  ABSTRACT  (Maximum  200  Words) 

An  enormous  amount  of  evidence  has  implicated  members  of  the  receptor  protein  tyrosine  kinase  (RPTK)  family  in  many  types  of 
cancer.  We  have  developed  an  extraordinarily  powerful  technique  that  uses  small,  DNA-encoded  RNAs  to  suppress  the  expression  of  a 
desired  gene  in  mammalian  cells.  For  this  project,  short  hairpin  RNAs  targeting  each  of  the  58  human  receptor  protein  tyrosine  kinases 
will  be  created ,  introduced  into  MCF-7  cells  and  tested  for  effects  on  tumorigenicity  in  vivo  and  in  vitro.  At  the  end  of  die  initial 
twelve  months  of  the  project  there  has  been  good  progress.  The  goal  of  year  one  was  to  create  a  set  of  encoded  hairpins  targeted  against 
the  genomic  complement  of  RPTKs.  This  goal  has  been  met.  By  adopting  high-throughput  approaches  to  the  construction  of  hairpins 
we  have  created  a  set  of  silencing  agents  not  only  for  the  genomic  complement  of  receptor  tyrosine  kinases,  but  also  all  tyrosine 
kinases  and  a  large  number  of  genes  functionally  related  to  the  RPTKs.  Other  developments  include  the  use  of  shRNAs  in  the 
production  of  transgenic  animals  and  microarray-based  shRNA  delivery  for  phenotypic  screening.  Both  of  these  are  potentially  useful 
in  future  studies  of  the  tyrosine  kinase  targeting  constructs. 


14.  SUBJECTTERMS 

Functional  genomics,  cell  signaling,  gene  therapy,  RNAI 


17.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  19.  SECURITY  CLASSIFICATION 
OF  REPORT  OF  THIS  PAGE  OF  ABSTRACT 

Unclassified  Unclassified  Unclassified 


NSN  7540-01-280-5500 


15.  NUMBER  OF  PAGES 

11 


16.  PRICE  CODE 


20.  LIMITATION  OF  ABSTRACT 


Unlimited 


Standard  Form  298  (Rev.  2-89) 
Prescribed  by  ANSI  Std.  Z39-1B 
298-102 


Table  of  Contents 


Cover . 1 

SF  298 . 2 

Table  of  Contents . 3 

Introduction . 4 

Body . 5 

Key  Research  Accomplishments . 5 

Reportable  Outcomes . N/A 

Conclusions . 5 

References . N/A 

Appendices . 6 


3 


Introduction 


Gain-of-function  mutations  and  the  amplification  of  genes  related  to  proliferation  are  universal 
components  of  cancer.  Generally,  these  mutations  account  for  the  most  profound  phenotypic 
dissimilarities  between  cancer  cells  and  the  normal  cells.  As  such  the  affected  genes  and  gene 
products  are  the  most  useful  possible  targets  for  specific  chemotherapeutics.  Among  the  genes 
most  often  overexpressed  in  cancer  are  the  receptor  protein  tyrosine  kinases  (RPTKs).  An 
enormous  amount  of  evidence  has  implicated  these  proteins  in  many  types  of  cancer.  The  HER- 
2/neu/ErbB-2  is  an  important  target  for  a  variety  of  second  generation  chemotherapeutics.  A 
growing  body  of  evidence  has  implicated  other  receptor  protein  tyrosine  kinases  in  breast 
cancer.  We  have  developed  an  extraordinarily  powerful  technique  for  suppressing  gene 
expression  in  mammalian  cells.  This  technique  uses  small,  DNA-encoded  RNAs  to  suppress  the 
expression  of  a  desired  gene,  stably  and  inexpensively  even  in  a  diploid  organism.  DNA 
encoded  short  hairpin  RNAs  targeting  each  of  the  58  human  receptor  protein  tyrosine  kinases 
will  be  cloned  into  mammalian  expression  vectors  and  introduced  into  MCF-7  cells.  The  effects 
of  the  constructs  on  RPTK  expression,  cell  viability  and  proliferation,  and  estrogen  dependence 
will  be  tested.  Hairpin  constructs  that  affect  cellular  correlates  of  tumorigenicity  will  be  tested 
further  for  effects  in  three-dimensional  culture  models.  Testing  of  these  promising  constructs  in 
subcutaneous  and  orthotopic  xenograft  models  will  also  be  carried  out.  Hairpin  RNAs  that  cause 
lethality  or  reversal  of  breast  cancer-related  phenotypes  specifically  in  cancer  cells  will  be 
identified  as  useful  tools  and  potential  therapeutics. 
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Body 


Although  good  progress  was  made  on  the  proposed  work  for  completion  by  the  end  of  twelve 
months.  The  major  goal  of  this  time  period  was:  Creation  of  a  set  of  encoded  hairpins 
targeted  against  the  genomic  complement  of  RPTKs.  This  goal  has  been  met.  As  described 
below,  by  adopting  high-throughput  approaches  to  the  construction  of  hairpins  we  have  created 
a  set  of  silencing  agents  not  only  for  the  genomic  complement  of  receptor  tyrosine  kinases,  but 
also  all  tyrosine  kinases  and  a  large  number  of  genes  functionally  related  to  the  RPTKs. 

As  was  mentioned  in  last  years  report,  I  was  recruited  to  the  State  University  of  New  York 
University  at  Albany’s  recently  established  Gen*NY*Sis  Center  for  Excellence  in  Cancer 
Genomics.  This  is  an  institution  that  focuses  on  the  type  of  work  proposed  in  the  present  award 
and  that  I  plan  to  pursue  in  my  career.  This  long  term  commitment  to  cancer  genomics  and 
improved  proximity  to  a  clinical  department  (Albany  Medical  College)  was  compelling.  That  I 
was  funded  by  the  DOD  was  instrumental  in  my  being  offered  the  job. 

Although  I  began  work  in  September  2003,  formal  transfer  of  this  award  was  a  protracted 
process.  Initiated  in  August  2003,  the  official  transfer  to  the  new  institution  and 
resumption  of  the  project  did  not  occur  until  August  31,  2004.  For  this  reason  little 
progress  was  made  in  resuming  the  proposed  work  in  the  time  covered  in  this  report. 


Key  Research  Accomplishments 
Reportable  Outcomes 


Manuscripts: 

Paddison*,  PJ,  Silva*,  JM,  Conklin*,  DS,  Schlabach,  M.,  Li,  M.,  Aruleba,  S.,  Balija,  V., 
O’Shaughnessy,  A.,  Gnoj,  L.,  Scobie,  K.,  Chang, K.,  Westbrook, T.,  Sachidanandam,  R., 
McCombie,  WR,  Elledge  SJ  and  Hannon,  GJ,  2004,  A  resource  for  large-scale  RNAi  based 
screens  in  mammals.  Nature  428,  427  -  431 

Abstracts:  NONE 

Presentations:  NONE 

Development  of  cell  lines,  tissue  or  serum  repositories:  NONE 

Funding  applied  for  based  on  work  supported  by  this  award:  NONE 

Employment  or  research  opportunities  applied  for  and/or  received  based  on 
experience/training  supported  by  this  award:  NONE 


Conclusions 


Year  1  of  funding  resulted  in  the  production  of  a  large  set  of  constructs  targeting  the  tyrosine 
kinases.  We  have  created  a  set  of  silencing  agents  not  only  for  the  genomic  complement  of 
receptor  tyrosine  kinases,  but  also  all  tyrosine  kinases  and  a  large  number  of  genes  functionally 
related  to  the  RPTKs.  This  is  an  excellent  start  to  our  ultimate  goal  of  identifying  a  small 
number  of  hairpins  that  have  inhibitory  effects  on  breast  cancer  progression  in  in  vivo  models. 

Little  has  changed  with  respect  to  Year  2  Proposed  Work  although  the  entire  project  has  been 
on  hiatus  for  a  year.  Hairpin  constructs  will  be  packaged  in  amphotropic  retroviral  packaging 
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cells.  Retroviral  supernatants  will  be  used  to  infect  estrogen  responsive  MCF-7  cells  in  six  well 
dishes.  Infected  populations  will  be  monitored  for  phenotypic  changes  related  to  tumorigenic 
potential,  e.g.  cellular  morphology,  growth  arrest,  apoptosis,  growth  in  soft  agar,  etc. 

“So  what?"  one  might  ask. 

Using  high  throughput  methodologies,  we  have  developed  a  resource  for  suppressing  gene 
expression  of  one  of  the  most  important  gene  families  in  cancer.  Although  we  are  determining 
the  effect  that  suppressing  tyrosine  kinase  gene  expression  has  on  the  tumorigenicity  of  breast 
cancer  cells,  it  is  hoped  that  the  hairpin  constructs  will  serve  as  a  resource  for  other 
investigators  studying  a  variety  of  cancers.  This  ultimately  will  add  to  our  understanding  of  the 
genes  encoding  potential  small  molecule  therapeutic  targets  in  breast  and  other  cancers.  It 
should  also  pave  the  way  for  testing  of  the  hairpins  that  we  identify  as  gene  therapeutics. 


Appendix:  Paddison*,  PJ,  Silva*,  JM,  Conklin*,  DS,  Schlabach,  M„  Li,  M„  Aruleba,  S„  Balija, 
V.,  O’Shaughnessy,  A.,  Gnoj,  L„  Scobie,  K„  Chang.K.,  Westbrook, T„  Sachidanandam,  R„ 
McCombie,  WR,  Elledge  SJ  and  Hannon,  GJ,  2004,  A  resource  for  large-scale  RNAi  based 
screens  in  mammals.  Nature  428,  427  -  431 
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A  resource  for  large-scale 
RNA-interference-based 
screens  in  mammals 

Patrick  J.  Paddlson'*,  Jose  M.  Silva'*,  Douglas  S.  Conklin ut, 

Mike  Schlabach2f,  Mamie  LI2,  Shola  Anileba',  Vivekanand  Balija1, 

Andy  O’Shaughnessy',  Lidia  Gno]’,  Kim  Scobie',  Kenneth  Chang1, 
Thomas  Westbrook2!,  Michele  Cleary3,  Ravi  Sachldanandam', 

W.  Richard  McCombie1,  Stephen  J.  Elledge  t  &  Gregory  J.  Hannon1 

1 Cold  Spring  Harbor  Laboratory,  Watson  School  of  Biological  Sciences, 

1  Bungtown  Road,  Cold  Spring  Harbor,  New  York  11724,  USA 

2  Department  of  Biochemistry,  Howard  Hughes  Medical  Institute,  Baylor  College  of 
Medicine,  One  Baylor  Plaza,  Houston,  Texas  77030,  USA 

3 Rosetta  Inphartnatics,  12040  115th  Avenue  NE,  Kirkland,  Washington  98034, 
USA 

*These  authors  contributed  equally  to  this  work 

t  Present  addresses:  Department  of  Biomedical  Sciences,  Center  for  Functional  Genomics,  University  at 
Albany,  East  Campus,  B342A,  One  University  Place,  Rensselaer,  New  York  12144-2345,  USA  (D.S.C.); 
Department  of  Genetics,  Harvard  Partners  Center  for  Genetics  and  Genomics,  Harvard  Medical  School 
Room  158D,  NRB,  77  Avenue  Louis  Pasteur,  Boston,  Massachusetts  02115,  USA  (M.S.,  T.W.  and  SJ.E.) 

Gene  silencing  by  RNA  interference  (RNAi)  in  mammalian  cells 
using  small  interfering  RNAs  (siRNAs)  and  short  hairpin  RNAs 
(shRNAs)  has  become  a  valuable  genetic  tool1-1".  Here,  we  report 
the  construction  and  application  of  a  shRNA  expression  library 
targeting  9,610  human  and  5,563  mouse  genes.  This  library  is 
presently  composed  of  about  28,000  sequence-verified  shRNA 
expression  cassettes  contained  within  multi-functional  vectors, 
which  permit  shRNA  cassettes  to  be  packaged  in  retroviruses, 
tracked  in  mixed  cell  populations  by  means  of  DNA  ‘bar  codes’, 
and  shuttled  to  customized  vectors  by  bacterial  mating.  In  order 
to  validate  the  library,  we  used  a  genetic  screen  designed  to  report 
defects  in  human  proteasome  function.  Our  results  suggest  that 
our  large-scale  RNAi  library  can  be  used  in  specific,  genetic 
applications  in  mammals,  and  will  become  a  valuable  resource 
for  gene  analysis  and  discovery. 

In  invertebrates,  RNAi  has  been  harnessed  as  a  powerful  genetic 


tool  to  reduce  endogenous  gene  expression  through  programming 
organisms  or  cells  with  homologous  double-stranded  (ds)RNA 
(reviewed  in  ref.  1).  In  Caenorhabditis  elegans ,  this  approach  has 
been  used  in  large-scale,  genome-wide  screens2-5.  With  the  advent  of 
RNAi  in  mammals  and  the  refinement  of  techniques  to  trigger  gene 
silencing,  expression  from  any  human  or  mouse  transcript  can,  in 
principle,  be  inhibited  using  siRNAs  or  shRNAs. 

To  facilitate  the  use  of  RNAi  as  a  genetic  tool  in  mammals,  we 
have  constructed  a  large-scale  library  of  RNAi-inducing  shRNA 
expression  vectors  targeting  human  and  mouse  genes.  We  began  by 
testing  a  number  of  variables  that  might  affect  shRNA  performance. 
We  previously  showed  that  shRNAs  containing  29  nucleotides  of 
dsRNA  and  simple  loop  structures  are  effective  silencing  triggers 
when  expressed  from  U6  small  nuclear  RNA  promoters6,7.  Other 
published  accounts  of  shRNA-mediated  inhibition  include  differ¬ 
ences  in  promoter  choice  (reviewed  in  ref.  8),  orientation  of  sense 
and  antisense  strands,  length  of  RNA  duplex  (19-29  nucleotides), 
loop  structure,  and  the  addition  of  a  27-nucleotide  U6  leader 
sequence9.  After  comparing  each  of  these  variables  with  a  series  of 
eight  shRNAs  targeting  firefly  luciferase  or  green  fluorescent  protein 
(GFP),  we  found  that  RNA  polymerase  III  promoters  were  largely 
interchangeable  (Supplementary  Fig.  la,  b),  that  29-nucleotide 
hairpins  were  more  effective  than  shorter  shRNAs  (Supplementary 
Fig.  la  and  data  not  shown),  that  changes  in  loop  structure  had 
minimal  effects  (data  not  shown),  and  that  the  addition  of  the  U6 
leader  sequence  had  a  positive  effect  (Supplementary  Fig.  lc).  The 
finalized  hairpin  design  for  this  library  is  presented  in  Supplemen¬ 
tary  Fig.  2a.  It  contains  a  27-nucleotide  U6  leader  sequence, 
followed  by  29  base  pairs  (bp)  of  dsRNA  and  a  4-nucleotide  loop. 
Recent  studies  have  suggested  that  each  shRNA  is  processed  from  its 
stem  end  by  Dicer  through  a  single  cleavage  event  (D.  Siolas  and 
G.J.H.,  unpublished  data).  Thus,  the  combination  of  Drosha  and 
Dicer  processing  is  predicted  to  generate  a  precisely  defined  siRNA 
from  each  29-nucleotide  shRNA  expression  vector. 

shRNAs  were  designed  covering  approximately  10,000  human 
and  5,000  mouse  genes  with  between  three  and  nine  constructs  each 
(Supplementary  Fig.  2b).  Only  coding  sequences  were  targeted,  and 
each  shRNA  was  chosen  such  that  it  contained  >3  mismatches  to 
any  other  gene.  Where  possible,  shRNAs  had  sequence  identity  to 
the  mouse  orthologue  of  the  targeted  gene.  In  pilot  experiments, 
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Figure  1  pSHAG-MAGIC  shRNA  cassette  movement  strategy,  a,  Map  of  the  pSHAG- 
MAGIC  vector.  CM',  chloramphenicol-resistance  gene;  Kan',  kanamycin-resistance  gene; 
LTR.  long  terminal  repeat;  OriT,  origin  of  transfer,  b,  A  diagrammatic  representation  of 
DNA  exchanges  occurring  once  the  pSHAG-MAGIC  donor  vector  has  been  transferred  to 
cells  containing  a  recipient  vector  by  mating.  In  this  case  pLenti-LoxP  (M.L.  and  S.J.E., 
unpublished  data)  is  the  recipient  vector.  WPRE,  woodchuck  hepatitis  B  post- 
transcriptional  responsive  element,  c,  Plasmids  from  ten  independent  colonies  (post¬ 


mating)  were  digested  with  AWel  and  the  digestion  products  were  separated  on  a  0.5% 
agarose  gel  (lanes  1-10).  The  parental  plasmids,  pSHAG-MAGIC  (lane  11)  and 
pLenti-LoxP  (lane  1 2),  each  contain  a  single  Mfel  site,  and  on  cleavage  generate  a  7.0-  or 
10.6-kilobase  (kb)  fragment,  respectively.  The  pSHAG-MAGIC  vector  contains  a  unique 
Ndd  site  in  the  U6  promoter,  which  is  transferred  into  the  recipient  vector.  The  correct 
mating  product  generates  two  fragments  of  3.5  kb  and  7.4  kb.  M,  marker. 
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between  25-75%  of  cloned  shRNAs  contained  significant  mutations, 
which  arose  during  chemical  synthesis.  As  a  result  we  sequence- 
verified  each  shRNA  in  the  library  (see  Supplementary  Fig.  2b). 

Our  current  sequence- verified  library  has  a  representation  of 
9,610  human  and  5,563  mouse  genes.  This  corresponds  to  28,659 
shRNAs  targeting  human  genes  and  9, 1 19  shRNAs  targeting  mouse 
genes.  Examples  of  library  coverage  for  selected  functional  cat¬ 
egories  of  human  genes  are  presented  in  Supplementary  Table  1. 
The  most  thoroughly  represented  functional  groups  in  our  current 
library  are  kinases  and  phosphatases,  in  which  approximately  85% 
of  all  known  kinases  and  phosphatases  are  targeted  by  three  or  more 
hairpins.  Most  other  functional  classes  contain  between  30-60% 
coverage  with  three  or  more  hairpins,  and  >80%  of  genes  in 
functional  categories  listed  in  Supplementary  Table  1  are  targeted 
by  at  least  one  sequence-verified  hairpin. 

The  shRNA  expression  library  has  been  constructed  in  a 
vector  that  contains  a  number  of  convenient  design  features 
(pSHAG-MAGIC,  Fig.  la).  The  vector  is  capable  of  producing 
self-inactivating  murine-stem-cell-virus  (MSCV)  particles  in 
commonly  available  retroviral  packaging  lines.  We  have  also  incor¬ 
porated  a  new  in  vivo  subcloning  technology.  This  method  is  called 
‘Mating-Assisted  Genetically  Integrated  Cloning’  (MAGIC)  (M.L. 
and  S.J.E.,  manuscript  in  preparation).  The  MAGIC  system  consists 
of  a  donor  vector  (the  library  vector),  in  which  the  fragment  of 


interest  is  flanked  by  two  different  50-bp  homology  regions,  HI  and 
H2,  which  in  turn  are  flanked  with  linked  1-SceI  sites.  The  donor 
vector  also  includes  an  F'  origin  and  a  conditional  origin  of 
replication  (RK6y;  Fig.  la,  b).  The  recipient  vector,  which  also 
contains  I-Scel-linked  HI  and  H2  sites  surrounding  a  negative 
selectable  marker  ( pheS ),  resides  in  a  bacterial  strain  that  contains 
an  inducible  I-Scel  gene  (Fig.  lb).  After  transfer  of  the  donor  vector 
into  the  recipient  host  by  bacterial  mating,  I-Scel  cleaves  both  donor 
and  recipient  vectors,  and  these  breaks  are  healed  by  homologous 
recombination  via  the  HI  and  H2  sequences.  Selection  against  the 
unrecombined  recipient  containing  pheS  and  I-Scel  sites  and  for  the 
capture  of  the  appropriate  insert  (chloramphenicol  resistance)  gives 
essentially  100%  recovery  of  the  desired  plasmid.  To  test  MAGIC 
transfer  of  shRNA  clones,  we  developed  a  lentivirus  recipient  vector 
based  on  the  FUW  vector10.  Using  a  test  shRNA  clone  in  the  mating, 
we  observed  a  mating  efficiency  of  >6  X  106  clones  per  ml  of  mixed 
bacteria.  Restriction  analysis  showed  that  10  out  of  10  recombinants 
have  the  expected  structure  (Fig.  lc). 

The  shRNA  library  was  designed  to  function  for  both  genetic 
selections  and  screens.  For  selections  conferring  a  growth  advan¬ 
tage,  the  library  can  be  used  in  a  pooled  fashion.  Genetic  screens  (for 
example,  for  lethal  events)  can  be  carried  out  in  a  96-well  format; 
however,  this  method  is  labour  intensive  and  time  consuming.  To 
facilitate  such  screens,  we  have  adopted  a  DNA  bar-coding  strategy 
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Figure  2  Microarray  analysis  of  pSHAG-MAGIC  library  bar  codes,  a,  Self-self 
hybridization  of  pSHAG-MAGIC  library  bar  codes  obtained  from  approximately  15,000 
library  plasmids  prepared  from  E  coli.  The  DNA  microarray  (Agilent  Technologies)  Is 
composed  of  20,241  complementary  60-nuclotide  oligonucleotides  out  of  a  total  of 
22,575  elements  on  the  array,  including  controls;  the  remaining  represent  various 
positive  and  negative  controls.  Cy-labelled  cRNA  for  library  bar  codes  was  generated  and 
competitively  hybridized  as  described  in  the  Supplementary  Methods,  b,  An  analysis  of  a 
subset  of  255  60-nucleotide  bar  codes  from  a  versus  255  60-nucleotide  shRNA  probes. 
Each  60-nucleotide  shRNA  probe  contains  a  direct  repeat  of  the  29-nucleotide  gene- 
targeting  sequence  with  a  2-nucleotide  spacer,  c,  Microarray  analysis  of  pSHAG-MAGIC 
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bar  codes  from  transduced  NIH3  T3  cells.  Two  pools  of  2  x  107  cells  (A  and  B)  were 
infected  separately  and  harvested  48  h  after  infection.  Bar  codes  were  processed  and 
competitively  hybridized  as  described  in  the  Methods.  The  xaxis  is  an  index  of  all  the 
elements  (22,575),  bar  codes,  hairpins  and  controls  on  the  array.  The  y  axis  shows  the 
mean  measured  ratios  from  two  experiments  (forward  and  reverse  colour  orientation)  of 
3T3  pool  A  hybridized  against  3T3  pool  B,  plotted  on  a  log  scale  with  the  numbers 
indicating  fold  change.  Most  probes,  with  the  exception  of  a  few,  scatter  around  1 .0  (no 
change)  with  a  range  between  0.5  (twofold  decrease)  and  2.0  (twofold  increase), 
d,  Microarray  analysis  of  pSHAG-MAGIC  bar  codes  in  transduced  normal  human  diploid 
fibroblasts  (IMR90)  cells  carried  out  as  in  c. 
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that  has  been  used  previously  in  Saccharomyces  cerevisiae  deletion 
collections,  to  follow  individual  mutants  in  complex  populations 
via  microarray  analysis11"13.  We  have  linked  a  unique  60-nuclotide 
DNA  bar  code  to  each  shRNA  vector  to  allow  us  to  follow  the  fate  of 
shRNAs  in  populations  of  virally  transduced  cells.  To  monitor 
relative  frequencies  of  individual  shRNA  species,  we  designed 
microarrays  (Agilent  Technologies)  containing  20,241  60-nucleo¬ 
tide  DNAs  complementary  to  each  shRNA  bar  code. 

Sequence  analysis  of  library  clones  revealed  that  a  fraction  (about 
4%)  contained  orphan  shRNAs  without  bar  codes.  We  therefore 
sought  to  determine  whether  we  could  use  the  shRNA  sequence  as  a 
bar  code.  We  chose  255  non-orphan  shRNAs  and  deposited  on 
arrays  both  associated  bar  codes  and  oligonucleotides  designed  have 
60  hybridize  to  the  shRNA  itself.  Such  oligonucleotides  were 


designed  nucleotides  in  order  to  contain  direct  repeats  of  the  29- 
nucleotide  antisense  sequence.  Cy-labelled  antisense  RNA  was 
generated  from  about  15,000  library  constructs  such  that  it  con¬ 
tained  the  reverse  complement  of  the  bar  code  and  the  entire 
shRNA.  A  linear  log-log  plot  of  a  self  hybridization  revealed 
consistent  cRNA  production  and  hybridization  from  the  plasmid 
library  (Fig.  2a).  However,  hairpins  gave  substantially  lower  hybrid¬ 
ization  signals,  essentially  at  background  levels,  relative  to  the 
corresponding  bar  codes  (Fig.  2b),  indicating  that  using  hairpins 
as  probes  is  not  an  optimal  bar-coding  strategy  under  these 
conditions. 

We  next  asked  whether  bar  codes  could  be  used  to  report  the 
representation  of  individual  shRNAs  in  transduced  mammalian 
cells.  Normal  human  diploid  fibroblasts  (IMR90)  or  NIH3T3  cells 


pSHAG*MAG*C  library  construe! 


Figure  3  A  reverse  genetic  screen  for  defects  in  human  proteasome  function,  a.  A  graph 
of  relative  ZsGreen  fluorescence  for  all  pSHAG-MAGIC  clones  transfected.  Red  vertical 
bars  indicate  22  positively  scoring  proteasome  shRNAs  corresponding  to  15  known 
proteasome  subunits,  whereas  black  bars  indicate  library  shRNAs.  The  yellow  horizontal 
bar  Indicates  a  cut-off  that  was  set  based  on  control  experiments,  b,  A  replicate 
transfection  experiment  as  in  a  using  a  set  of  22  positively  scoring  proteasome  hairpins 
and  33  that  failed  to  score  (blue  bars),  along  with  non-proteasome  hairpin  controls  (green 
bars).  These  experiments  were  carried  out  in  triplicate.  All  22  of  the  proteasome  hairpins 
with  a  positive  score  from  the  original  screen  were  re-tested  and  also  achieved  a  positive 
score.  None  of  the  33  non-scoring  proteasome  hairpins  from  the  original  screen  managed 
to  match  the  score  of  the  22  positively  scoring  proteasome  shRNAs  from  the  first  round 
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(yellow  line).  However,  in  this  experiment  36  of  55  scored  well  above  (~2  s.d.)  the  mean 
background  (red  line),  c,  Fluorescence  microscopy  images  showing  representative  results 
for  individual  proteasome  hairpins  as  carried  out  in  a  and  b.  A  shRNA  targeting  firefly 
luciferase  (Ff-1)  and  small  molecule  proteasome  inhibitor,  MG-1 32  (Sigma),  were  used  as 
negative  and  positive  controls,  respectively,  d,  A  diagrammatic  representation  of  the  26S 
proteasome  colour-coded  according  to  pSHAG-MAGIC  library  hits.  Subunits  coloured 
green  had  strong  positive  shRNA  hits  from  the  library  in  the  primary  screen.  Grey  subunits 
were  not  represented  by  any  shRNAs  in  the  approximately  7,000  tested.  Subunits 
coloured  blue  were  represented  by  at  least  1  shRNA  but  did  not  score  in  the  screen.  For 
nomenclature  see  Supplementary  Table  2. 
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were  infected  with  retroviruses  derived  from  the  library  population 
that  was  used  for  control  hybridizations.  To  test  for  experimental 
variability,  two  populations  of  2  X  107  cells  (denoted  A  and  B)  were 
infected  independently  at  a  multiplicity  of  infection  of  approxi¬ 
mately  1.  Examination  of  colour-reversal  experiments  and  plots 
comparing  the  relative  intensity  of  the  bar-code  signals  in  these 
populations  (Figs  2c,  d)  indicated  that  all  steps  of  the  procedure 
were  highly  reproducible,  with  the  ratio  of  intensities  varying  by  less 
than  twofold  in  all  but  a  few  isolated  cases  for  each  cell  line. 

Finally,  we  sought  to  test  the  performance  of  the  shRNA  library  in 
a  biological  context.  We  focused  on  an  assay  for  which  we  could 
predict  the  recovery  of  a  substantial  number  of  shRNAs  targeting 
genes  in  a  known  biological  pathway.  The  26S  proteasome  is  the 
major  non-lysosomal  protease  in  eukaryotic  cells.  To  search  the 
library  for  shRNAs  that  compromise  proteasome  function,  we  used 
a  reporter  assay  in  which  a  fluorescent  protein  is  coupled  to  a  well- 
characterized  degradation  signal.  The  mouse  ornithine  decarboxy¬ 
lase  (MODC)  gene  contains  a  PEST  sequence  that  directs  protea- 
somal  degradation  without  the  need  for  ubiquitination1"1.  Roughly 
7,000  shRNA  expression  plasmids,  corresponding  to  approximately 
one-quarter  of  the  complete  library,  were  individually  co-trans- 
fected  with  two  expression  constructs.  The  first  encoded  a  Zoanthus 
green  fluorescent  (ZsGreen)-MODC  degron  fusion.  In  cases  in 
which  shRNAs  compromised  proteasome  function,  ZsGreen- 
MODC  was  expected  to  accumulate,  giving  a  detectable  signal. 
The  second  plasmid  encoded  Discosoma  red  fluorescent  protein 
(DsRed).  This  permitted  normalization  of  signals,  which  controlled 
for  transfection  efficiency. 

An  analysis  of  6,712  shRNAs  targeting  4,873  genes  revealed 
approximately  100  RNAi  constructs  that  increased  the  accumu¬ 
lation  of  ZsGreen-MODC  (Fig.  3a).  Twenty-two  of  these  corre¬ 
sponded  to  15  known  proteasome  subunits.  As  a  secondary  test, 
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Figure  4  Further  validation  of  selected  pSHAG-MAGIC  proteasome  hairpins,  a,  Western 
blot  showing  specific  suppression  of  the  ATPase-1  and  non-ATPase-1  of  the  19S 
regulatory  subunit  in  transiently  transfected  HEK293  cells.  Cells  were  transfected  with 
shRNAs  as  indicated.  Knockdown  of  protein  levels  for  shRNA-1  and  -3  against  ATPase-1 
proteasome  correlated  with  the  severity  of  the  relative  scores  in  the  pZsGreen-MODC 
accumulation  assay.  The  lane  labelled  untransfected  indicates  the  control  lane  where 
cells  were  not  transfected,  b,  A  western  blot  showing  increased  steady-state  levels  of 
endogenous  c-Myc  in  HEK293  cells  transiently  transfected  with  library  shRNAs  as 
indicated.  c-Myc  is  normally  degraded  by  ubiquitin-mediated  proteolysis  in  these  cells'9. 


these  22  putative  proteasome-positive  shRNAs,  and  an  additional 
33  shRNAs  that  targeted  proteasome  subunits  but  which  had  not 
scored  in  the  original  screen,  were  selected  from  the  population. 
These  were  assayed  in  replicate  transfections  for  the  ability  to 
increase  the  stability  of  the  unstable  fluorescent  protein.  Again, 
the  22  shRNAs  scored  positively,  whereas  the  33  proteasome 
shRNAs  not  detected  in  the  initial  screen  scored  less  well.  However, 
an  additional  14  shRNAs  did  score  above  background  in  the  focused 
assay  (Fig.  3b,  c). 

Notably,  many  of  the  positive  shRNAs  targeted  19S  base  subunits, 
including  five  out  of  five  ATPases  and  the  two  largest  non-ATPases 
(1  and  2)  (Fig.  3c,  d;  see  also  Supplementary  Table  2).  Compared 
with  the  19S  base,  targeting  subunits  of  the  19S  lid  or  the  20S  core 
produced  a  lower  hit  rate.  However,  two  out  of  three  of  the  most 
important  catalytic  components,  located  in  subunits  (3 1  (peptidyl- 
glutamyl  hydrolysing  or  caspase-like)  and  35  (chymotrypsin- 
like)15’16,  achieved  a  positive  score  in  our  assay  (Fig.  3).  The  other 
two  3-subunits  that  we  identified  are  involved  in  important  cis 
contacts  between  neighbouring  subunits  (33)  and  in  trans  contacts 
between  the  two  3-rings  (37)'7,18.  In  all  cases  tested,  activation  of  the 
reporter  was  accompanied  by  a  reduction  in  the  expression  of 
targeted  proteasomal  components  (Fig.  4a  and  data  not  shown). 

The  c-Myc  oncoprotein  is  a  target  for  ubiquitin-mediated  degra¬ 
dation  by  the  proteasome19.  As  was  observed  for  ZsGreen-MODC 
degron  and  bulk-ubiquitinated  protein  (not  shown),  c-Myc  levels 
specifically  increased  upon  transfection  of  cells  with  shRNAs  that 
targeted  proteasomal  subunits  (Fig.  4b). 

Here,  we  report  progress  towards  the  construction  of  a  genome¬ 
wide  library  of  RNAi-inducing  constructs  for  use  in  mammalian 
cells.  At  present  the  resource  targets  approximately  10,000  human 
and  5,000  murine  genes,  and  it  continues  to  expand.  To  examine  the 
performance  of  the  library,  we  have  tested  about  one-quarter  of  its 
constituent  clones  (roughly  7,000  shRNA  expression  vectors)  indi¬ 
vidually  for  the  ability  to  inhibit  the  degradation  of  a  direct 
proteasome  target.  Nearly  50%  of  the  shRNAs  in  this  collection 
that  were  expected  to  target  proteosomal  proteins  were  recovered  as 
positives.  The  availability  of  this  resource  to  the  research  commu¬ 
nity  will  open  the  door  to  the  use  of  RNAi  in  mammalian  systems  as 
a  large-scale  tool  for  biological  discovery.  □ 


Methods 

A  detailed  protocol  for  the  design  and  creation  of  shRNA  polymerase  chain  reaction 
(PCR)  cassettes  is  available  at  http://www.cshl.edu/public/SCIENCE/hannon.html. 
Oligonucleotides  were  purchased  from  Sigma-Genosys.  The  pSHAG-MAGIC  shRNA 
library  was  cloned  in  the  MAGIC-competent  Escherichia  coli  strain  BW23474F'DOT 
(M.L.  and  S.J.E.,  manuscript  in  preparation).  The  full  sequence  of  the  pSHAG-MAGIC 
vector  and  details  of  the  mating  protocol  are  available  for  download  at  the  shRNA  library 
website  (http://www.cshl.edu/public/SCIENCE/hannon.html).  Bar-code  microarrays 
were  synthesized  by  Agilent  Technologies  and  were  analysed  as  described  in  the 
Supplementary  Methods. 

Plasmids  were  prepared  as  described  in  the  Supplementary  Methods.  Transfections  were 
carried  out  in  96-well  plates  using  plasmid  mixtures  described  in  Supplementary  Methods. 
Fluorescence  signals  were  read  on  a  Victor2  plate  reader.  Signals  in  the  green  channel  were 
normalized  to  transfection  efficiency  using  customized  scripts  with  fluorescence  in  the  red 
channel  serving  as  a  normalization  criterion.  Cut-offs  were  assigned  by  using  16 
independent  control  shRNA  transfections  to  determine  the  range  for  a  negative  outcome. 
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RNA  interference  (RNAi)  is  a  powerful  new  tool  with  which  to 
perform  loss-of-function  genetic  screens  in  lower  organisms  and 
can  greatly  facilitate  the  identification  of  components  of  cellular 
signalling  pathways1-3.  In  mammalian  cells,  such  screens  have 
been  hampered  by  a  lack  of  suitable  tools  that  can  be  used  on  a 
large  scale.  We  and  others  have  recently  developed  expression 


vectors  to  direct  the  synthesis  of  short  hairpin  RNAs  (shRNAs) 
that  act  as  short  interfering  RNA  (siRNA)-like  molecules  to  stably 
suppress  gene  expression4,5.  Here  we  report  the  construction  of  a 
set  of  retroviral  vectors  encoding  23,742  distinct  shRNAs,  which 
target  7,914  different  human  genes  for  suppression.  We  use  this 
RNAi  library  in  human  cells  to  identify  one  known  and  five  new 
modulators  of  p53-dependent  proliferation  arrest.  Suppression 
of  these  genes  confers  resistance  to  both  p53-dependent  and 
p/94Rf-dependent  proliferation  arrest,  and  abolishes  a  DNA- 
damage-induced  G1  cell-cycle  arrest.  Furthermore,  we  describe 
siRNA  bar-code  screens  to  rapidly  identify  individual  siRNA 
vectors  associated  with  a  specific  phenotype.  These  new  tools 
will  greatly  facilitate  large-scale  loss-of-function  genetic  screens 
in  mammalian  cells. 

RNAi  is  a  defence  mechanism  triggered  by  double-stranded 
(ds)RNAs  to  protect  cells  from  parasitic  nucleic  acids.  The  dsRNAs 
are  processed  into  siRNAs,  which  target  homologous  RNAs  for 
destruction6.  In  mammalian  cells,  an  RNAi  response  can  be  trig¬ 
gered  by  21 -base-pair  siRNAs,  which  can  cause  strong,  but  transi¬ 
ent,  inhibition  of  gene  expression7.  By  contrast,  vector- expressed 
shRNAs  can  suppress  gene  expression  over  prolonged  periods4,5.  In 
the  current  study,  we  create  a  large  set  of  vectors  and  use  them  to 
search  for  components  of  the  p53  tumour-suppressor  pathway.  This 
pathway  is  crucial  for  genome  integrity  as  it  transmits  both  anti¬ 
proliferative  and  pro-apoptotic  signals  in  response  to  a  variety  of 
stress  signals*. 

To  construct  a  human  RNAi  library  (the  ‘NKi  library’),  we 
selected  7,914  human  genes  for  shRNA-mediated  reduction  in 
expression,  known  as  knockdown.  This  collection  of  genes  includes 
components  of  major  cellular  pathways,  including  the  cell  cycle, 
transcription  regulation,  stress  signalling,  signal  transduction  and 
important  biological  processes  such  as  biosynthesis,  proteolysis  and 
metabolism.  In  addition,  genes  implicated  in  cancer  and  other 
diseases  are  included  in  the  library  (see  Supplementary  Table  1). 
To  increase  the  likelihood  of  obtaining  a  significant  inhibition  of 
gene  expression,  we  constructed  three  different  shRNA  vectors 
against  each  gene  (23,742  vectors  in  total;  Fig.  la  and  Supplemen¬ 
tary  Fig.  1).  The  oligonucleotides  specifying  the  shRNAs  were 
annealed  and  cloned  in  a  high-throughput  fashion  into  pRetroSu- 
per  (pRS),  a  retroviral  vector  that  contains  the  shRNA  expression 
cassette'*.  Using  a  pool  of  three  knockdown  vectors  against  a  single 
gene,  we  obtain  on  average  70%  inhibition  of  expression  for 
approximately  70%  of  the  genes  in  the  library  (see  ref.  10;  data 
not  shown).  The  vector-based  shRNA  library  can  be  used  for 
functional  genetic  screens  in  both  short-term  and  long-term  assays 
using  DNA  transfection  or  retroviral  transduction. 

To  validate  the  RNAi  library,  we  developed  a  cell  system  to  screen 
for  bypass  of  p53-dependent  proliferation  arrest.  We  generated 
primary  human  BJ  fibroblasts,  which  ectopically  express  the  murine 
ecotropic  receptor,  the  telomerase  catalytic  subunit  (TERT)  and  a 
temperature-sensitive  allele  of  SV40  large  T  antigen  (tsLT),  yielding 
BJ-TF.RT-tsLT  cells.  As  can  be  seen  in  Fig.  lb,  these  cells  proliferate 
when  grown  at  32  °C,  the  temperature  at  which  tsLT  binds  and 
inactivates  both  retinoblastoma  protein  (pRB)  and  p53,  but  enter 
into  a  synchronous  proliferation  arrest  after  a  shift  to  39  °C,  at 
which  tsLT  is  inactive.  To  determine  whether  this  proliferation 
arrest  is  p53  dependent,  we  infected  BJ-TERT-tsLT  cells  with 
pRS-p53  (which  targets  p53  for  suppression)  at  32  °C,  and  shifted 
them  to  39  °C  after  two  days.  Figure  lb  shows  that  knockdown  of 
p53  allowed  temperature-shift-induced  proliferation  arrest  to  be 
bypassed.  Knockdown  of  the  RB  pathway  component  pl6,NK4A 
alone  did  not  allow  growth  arrest  to  be  bypassed,  but  simultaneous 
suppression  of  both  pl6,NK4A  and  p53  yielded  a  further  increase  in 
escape  from  growth  arrest  compared  with  knockdown  of  p53  alone 
(Fig.  lb).  Thus,  the  conditional  proliferation  arrest  in  BJ  fibroblasts 
depends  primarily  on  p53. 

We  isolated  polyclonal  plasmid  DNA  from  each  of  the  83  pools  of 
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